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The maximum likelihood strategy to the estimation of group parameters allows to derive 
■ in a general fashion optimal measurements, optimal signal states, and their relations with 

' other information theoretical quantities. These results provide a deep insight into the 

general structure underlying optimal quantum estimation strategies. The entanglement 
between representation spaces and multiplicity spaces of the group action appear to be 
the unique kind of entanglement which is really useful for the optimal estimation of 
group parameters. 
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1. Introduction 

Since the beginning of quantum estimation theory 1,2 , the research about measure- 
ments and estimation strategies for the optimal detection of physical parameters 
has been a major focus. In particular, the case where the physical parameters to 
be estimated correspond to the unknown action of some symmetry group has re- 
ceived a constant attention 3 ' 4 ' 5 . The reason of such interest, that in the last ten 
years received a strong motivation from the new field of quantum information, is 
the broad spectrum of applications of the topic, ranging from quantum statistics 
to quantum cryptography, from the study of uncertainty relations to the design of 
high sensitivity measurements achieving the ultimate quantum limit. 



* chiribcllaOfisicavolta. unipv. it 

tdariano@unipv.it 

t perinottiOfisicavolta. unipv. it 

§ msacchi@unipv.it 

1 



I 



1, 2008 10:2 WSPC/INSTRUCTION FILE MLreview 



2 G. Chiribella, G. M. D'Ariano, P. Perinotti, and M. F. Sacchi 

Despite the long-dated attention to the problem, many new examples and ap- 
plications of group parameter estimation came up recently, and, on the other hand, 
some controversial points have been clarified only in the very last years. In the rich 
variety of this scenario, it is somehow natural to look for a general point of view, 
suitable for capturing the main features of optimal estimation without entering the 
specific details of the symmetry group involved in the particular problem at hand. 

In this paper, we will show how such kind of general insight can be provided in 
a simple way in the maximum likelihood approach 6 , where the measurements are 
designed to maximize the probability of estimating the true value of the unknown 
group parameter. In this approach, the relations between the quality of the estima- 
tion and other information theoretic properties, such as the Holevo x~ quantity 7 
and the dimension of the space spanned by a quantum state under the action of 
the group become straightforward. 

The results of the maximum likelihood method recently allowed to understand 
the crucial role of the equivalent irreducible representations of the group in the 
optimal estimation strategy 8 , giving a striking application of this mechanism with 
the solution of a long-standing controversy 9 about the efficiency in the absolute 
transmission of a Cartesian reference frame. In that application, the technique of 
equivalent representations has been the key idea for an efficient use of quantum 
resources. 

Here we will show in a general fashion that, far from being a technicality, the 
use of equivalent representations is synonymous of the use of a unique kind of 
entanglement that is really suitable for group parameter estimation. More precisely, 
such kind of entanglement is the entanglement between spaces where the group 
acts irreducibly (representation spaces) and spaces where the group acts trivially 
(multiplicity spaces). This entanglement is related only to the group, without any 
relation with other "natural" tensor product structures that can be present in the 
system, e.g. when the system is made by a set of distinguishable particles. 

The concept of representation/multiplicity is well known in the field of 
quantum error correction 10 ' 11 , where the multiplicity spaces are referred to as 
decoherence-free subspaces and, more generally, noiseless subsystems. Moreover, 
the same concept has recently found applications also in the context of quan- 
tum communication 12 and cryptography 13 . On the other hand, the application to 
quantum estimation of tools such as representation/multiplicity spaces and entan- 
glement is a completely new issue 8 ' 14 . In the maximum likelihood approach the 
maximal entanglement between representation and multiplicity spaces appears to 
be the unifying feature of the optimal strategy for any group parameter estimation. 
Moreover, the validity of this result is not limited to the maximum likelihood esti- 
mation, and can be extended to estimation schemes where different figures of merit 
are considered 15 . 

In Section 2 wc will present the general approach to group parameter estimation. 
We will firstly start with a brief self-contained introduction about group parameter 
estimation (2.1), maximum likelihood approach (2.2), group theoretical tools (2.3), 
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and covariant measurements (2.3). After that, we will derive in the general setting 
the optimal measurements (2.5), and the optimal states (2.6), emphasizing the role 
of equivalent representations and the relations with the Holcvo \— quantity. Finally 
we will conclude the section with a discussion (2.7) about the use of entanglement 
with an additional reference system, showing how to efficiently use such resource. 

In Section 3, all the results will be generalized to infinite dimension (Subsec- 
tion 3.1) and non-compact groups (Subsection 3.2). As an example of application, in 
Subsection 3.3 we provide a detailed analysis of the estimation of the joint displace- 
ment of two harmonic oscillators in the phase space, underlying in this continuous 
variable context the connections with the well known example by Gisin and Popescu 
about the estimation of a direction using parallel and anti-parallel spins 18 . 

2. A general approach to group parameter estimation 
2.1. Background 

The issue of this paper is the problem of optimally discriminating a family of 
quantum signal states, which is the orbit generated by a given input state under 
the action of a group. In other words, if the input state is the density matrix 
p £ B(H) on the Hilbert space H, then we want to find the best estimation of the 
states in the orbit 

O = { Pg = U g pU\ I g £ G} , (1) 

obtained by transforming the input state with the unitary representation {U g } of 
the group G. 

In general, the points of the orbit arc not in one-to-one correspondence with 
the elements of the group, since one can have p gi — p g2 even for different g\ and 
<72- However, in this paper we will consider for simplicity the case where the cor- 
respondence between the group and the orbit is one-to-one, since, a part from a 
technical complication in the notation, the extension of the results to the general 
case is straightforward. 

In the case of one-to-one correspondence between signal states and elements of 
the group, the problem of state estimation becomes equivalent to estimating the 
action of a black-box that performs an unknown unitary transformation drawn from 
the set {Ug \ g £ G}. From this point of view it is also important to find the optimal 
input states that allow to discriminate the action of the unitary operators {U g } in 
the best possible way. 

Whatever point of view we choose, we always need to estimate the value of 
a group parameter. In order to do this, the most general estimation strategy al- 
lowed by quantum mechanics is described by a Positive Operator Valued Measure 
(POVM) M, that associates to any estimated parameter g £ G a, positive semidef- 
inite operator M(g) on H, satisfying the normalization condition 

f &gM(g) = \, (2) 
Jg 
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d g being the normalized invariant Haar measure on the group ( J G d g = 1 
d(hg) = d(gh) = dg Vg,h e G). The probability density of estimating g 
when the true value of the parameter is g is then given by the usual Born rule: 

p(g\g) = Tr{ Pg M(g)} , (3) 

where p g = U g pU^. Note that here we are considering G as a continuous group only 
for fixing notation, nevertheless — here and all throughout the paper — G can have 
a finite number of elements, say |G|, and in this case we have simply to replace 
integrals with sums and dg with 1/|G|. 

In order to find an optimal estimation strategy, we need firstly to fix our opti- 
mality criterion. The most common way to do this is to weigh the estimation errors 
with some cost function 1,2 f(g,g), that assesses the cost of estimating g when the 
true value is g. Once the cost function is fixed, we can adopt two different set- 
tings for the optimization, the Bayesian and the frcqucntistic one. In the Bayesian 
setting, one assumes a prior distribution of the true values (which is usually uni- 
form) and then the optimal estimation is the one that minimizes the average cost, 
where the average is performed with respect to the probability distributions of both 
the estimated and the true values. On the other hand, in the frcqucntistic setting 
no prior distribution of the true values is assumed, and one performs a minimax 
optimization, i. e. minimizing the maximum (over all possible true values) of the 
average cost, where now the average is done just over the estimated values. 

2.2. Maximum likelihood approach 

Many different criteria can be used to define what is an optimal estimation, each 
of them corresponding to a different choice of the cost function in the optimiza- 
tion procedure. In general, the choice of a cost function depends on the particular 
problem at hand. For example, if we need to estimate a state, a natural cost is the 
opposite of the fidelity between the estimated state and the true one, while, if we 
arc interested in the estimation of a parameter, a more appropriate cost function 
would be the variance of the estimated values. 

In this paper, however, since we want to investigate general properties of covari- 
ant estimation, we seek a criterion that maintains a clear meaning in the largest 
number of different situations. The simplest approach that allows a general ana- 
lytical solution is given by the maximum likelihood criterion 1,2 , which corresponds 
to the maximization of the probability (probability density in the continuous case) 
that the estimated value of the unknown parameter actually coincides with its true 
value. In this case, the cost function is a Dirac-delta f(g,g) = —5(g,g) (Kronecker- 
delta in the finite case). 

For finite groups maximum likelihood is in some sense the most natural criterion. 
In fact, if we are trying to decide among a finite set of alternatives which is the true 
one, of course we would like to do this with the maximum probability of success. 
On the other hand, in the continuous case, the maximum likelihood approach can 
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still be interpreted as the maximization of the probability that the estimated value 
lies in a narrow neighborhood of the true one. 

2.3. Basic results from group theory 

Here we will recall some useful tools from group theory that we will exploit through- 
out the paper. 

Consider a finite dimensional Hilbert space Ti and a unitary (or, more generally, 
projective) representation R(G) = {U g G B(H) \ g G G} of a compact Lie group G. 
The Hilbert space can be decomposed into orthogonal subspaces in the following 

way 

H = ® C m - , (4) 

where the sum runs over the set S of irreducible representations of G that appear in 
the Clebsch-Gordan decomposition of R(G). The action of the group is irreducible 
in each representation space H^, while it is trivial in the multiplicity space C mfl , 
namely 

U a = ® Ug ® l ro( . , (5) 
tj.es 

where each representation {Ug } is irreducible, and 1^ denotes the identity oper- 
ator in a d— dimensional Hilbert space. Moreover, any operator O G B(Tt) in the 
commutant of R(G) — i.e. such that [O, U g ] — Vg G G — has the form 

O = Id, ® O u , (6) 
tj.es 

where d u is the dimension of Ji^ , and O a is a m M x m a complex matrix. In particular, 
the group average (A)g = f dg U g AU^ of a given operator A with respect to the 
invariant Haar measure dg is in the commutant of R(G), and has the form: 




where Tr-^fA] is a short notation for Tr^^ [P^APJ , denoting the orthogonal 
projector over the Hilbert subspacc TL a (g> C m " in the decomposition (4) of H. 
Here and throughout the paper we assume the normalization of the Haar measure: 
J G dff = l. 

Remark I: entanglement between representation spaces and multiplicity spaces. 
The choice of an orthonormal basis B M = {\<fi%) G C " | n = 1, . . . , m M } for a 
multiplicity space fixes a particular decomposition of the Hilbert space as a direct 
sum of irreducible subspaces: 



Ti.fi ® c m - = ®™=i n% , 



(8) 
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where H% = <g> \4>n)- In this picture, it is clear that m M is the number of differ- 
ent irreducible subspaces carrying the same representation /i, each of them having 
dimension d^. Moreover, with respect to the decomposition (4), any pure state 
|$) can be written as 

l*>=0c„|* M », (9) 
pes 

where l^)) is a bipartite state in <g> C m " and X^es l c ^l 2 = 1- With respect 
to the direct sum decomposition (8), the Schmidt number of such a state is the 
minimum number of subspaces carrying the same representation /i € S that are 
needed to decompose |^). 

Remark II: maximum number of equivalent representations in the decomposi- 
tion of a pure state. 

The Schmidt number of any bipartite state l^)) € <g> C" v is always less 

then or equal to = min{d M , m^}. This means that any pure state can be decom- 
posed using no more than fc M irreducible subspaces carrying the same representation 
/i S S. 

2.4. Covariant measurements 

Since the set of states to be estimated is invariant under the action of the group, 
there is no loss of generality in assuming a covariant POVM, i.e. a POVM satisfying 
the property M(hg) = UhM(g)U\ for any g, h <G G. In fact, it is well known that, for 
any possible POVM, there is always a covariant one with the same average cost 2 , 
this result holding both in the minimax approach and in the Bayesian approach 
with uniform prior distribution. 
A covariant POVM has the form 

M(g) = UgZUl , (10) 

where H is a positive scmidchnite operator. For covariant POVM's, exploiting the 
formula (7) for the group average, the normalization condition (2) can be translated 
into a simple set of conditions for the operator 5: 

Tr„ M [S]=^ U m(j . (11) 

In this way, the optimization of a covariant POVM is reduced to the optimization 
of a positive operator satisfying the constraints (11). 

2.5. Optimal measurements 

Here we derive for any given input state \^) <E H. the measurement that maximizes 
the probability (density) of estimating the true value of the unknown group pa- 
rameter g € G. Note that, due to covariance, this probability has the same value 
for any group element: p(g\g) — (^ISI^), according to Eqs. (3) and (10). In order 
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to find the POVM, it is convenient to express the input state in the form (9), and 
write each bipartite state \^>u}} in the Schmidt form: 

l*„» = E v^£hOO, (12) 

m— 1 

where r a < k u — min{d M , m M } is the Schmidt number, and \f n > V/i,m. We 
can now define the projection 

It projects onto the subspace spanned by the orbit of the input state, this 
subspace being also the smallest invariant subspace containing the input state. 

Clearly, the probability distribution of the outcomes of a covariant measurement 
M(g) = UgEU- performed on any state in the orbit depends only on the projection 
P-q, SPj. Therefore, to specify an optimal covariant POVM for the state \*S?), we 
need only to specify the operator PyEPy. All covariant POVM's corresponding to 
the same operator will be equally optimal. 

Theorem 1 (optimal POVM). For a pure input state \^f), the optimal covariant 
POVM in the maximum likelihood approach is given by 

P*EP* = \ri)(ri\, (14) 

where 

\v) = yfce'^M J2 ICJIC) • (15) 

PES m=l 

The value of the likelihood for the optimal POVM is 



P° pt (9\g) = I E M E V ] v 5 . (16) 

Proof. Using Schwartz inequality, the likelihood can be bounded as follows: 
p(g\g) = (*|H|*> 

<E M 



< 
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Moreover, exploiting the Schmidt form (12) and applying a second Schwartz in- 
equality, we obtain 



m,n—l 

\m=l / 

Finally, the positivity of S implies 

(CKCI 2 < (CI Tr„jS] |C) = d, , 

due to the normalization condition (11). By putting together these inequalities, we 
obtain the bound 

2 



P(g\g) < E M E V = P° pt (3l3) , 

\/xeS m=l / 

holding for any possible POVM. It is immediate to see that the covariant POVM 
given by (14,15) achieves the bound, hence it is optimal. ■ 

Remark I: uniqueness of the optimal POVM. 
Since the Theorem specifies the optimal POVM only in the subspace spanned by 
the orbit of the input space, it follows that the optimal POVM is unique if and only 
if the orbit spans the whole Hilbert space. If it is not the case, one can arbitrarily 
complete the POVM given by (14) to the entire Hilbert space. 

Remark II: square-root measurements. 
The optimal POVM in Eqs. (14) and (15) coincides with the so-called "square-root 
measurement" 16 . In fact, such a measurement has a POVM with \rf) = F~z\ip), 
with the "frame operator" F given by F = J d gU g \ip) (ip\U% . Using Eq. (7) one finds 
^ = © M M 2 E m ^]lM® ICXCI. and one can easily check that \ V ) = F~ . 

2.6. Optimal input states 

While in the previous paragraph we assumed the input state to be given, and we 
were mainly interested in the problem of state estimation, here we will focus our 
attention on the problem of estimating the action of a black box that performs 
an unknown unitary transformation drawn from a group. From this point of view, 
our aim is now to determine which are the states in the Hilbert space that allow 
to maximize the probability of successfully discriminating the unknown unitaries 

We will first show that the dimension of the subspace spanned by the orbit of 
the input state is always an upper bound for the likelihood, and that this bound can 
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always be achieved by using suitable input states. Then, the optimal input states 
will be the ones that maximize the dimension of the subspace spanned by the orbit. 

Lemma 1. Let d\z = dim Span{J7 g | l I') | g G G} be the dimension of the subspace 
spanned by the orbit of the input state. Then 

u<ES 

where r M is the Schmidt number of the bipartite state l^)) (we define r M = if 
\c a \ =0 in the decomposition (9) ). 

Proof. The subspace spanned by the orbit is the support of the frame operator 

the r.h.s. coming from Eq. (7). Using the Schmidt form (12) of each bipartite state 
l^)), it follows that the dimension of the support is dm = YlueS dur^. ■ 

Theorem 2 (relation between likelihood and dimension). For any pure 
input state \^f) eH, the following bound holds: 

p(g\g) < dm . (18) 

The bound is achieved if and only if the state has the form 

|*) = J= ® y/d^e»» |*,)>, (19) 

where e t6fi are arbitrary phase factors and G Ti^ ® C™ M is a bipartite 

state with Schmidt number r u and equal Schmidt coefficients (A^ = l/r a for any 
m = 1,.. . ,r u ). 

Proof. Exploiting Eq.(16), we have 

p(9\9)<P° pt (g\g) (20) 

(21) 




(22) 
(23) 

the inequalities (22) and (23) coming from Schwartz inequality and from the nor- 
malizations J2m=i — 1 an d \ c u I 2 = 1- Let us see when this bound is attained. 
Clearly, the equality in (20) holds if we use the optimal POVM of Theorem 1. On the 
other hand, the Schwartz inequality (22) becomes equality if and only if = l/r a 
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for any to = 1, . . . , r^. Finally, the last Schwartz inequality (23) becomes equality 
if and only if |c M | = \J~ i ^ L - The requirements |c^| = \J and = l/r M are 
satisfied only by states of the form (19). ■ 

We can now answer to the question which are the best input states for discrim- 
inating a group of unitaries. 

Theorem 3 (optimal input states). For any state p on H and for any POVM, 
the likelihood is bounded from above by the quantity 

L = J2 dpkn , (24) 
lies 

where fe M = min{d M , m^}. The bound is achieved by pure states of the form 

|*> = J= @ y/d^e"* \E,)) , (25) 

where e 10 ^ are arbitrary phase factors and \E^)) e TL^ ® C m f are arbitrary 
maximally entangled states. 

Proof. Since the likelihood £[p] — Tr[pS] is a linear functional of the input state, it 
is clear that the maximum likelihood over all possible states is achieved by a pure 
state. Therefore, according to Eq. (18), the maximum likelihood is given by the 
maximum of over all pure states. Since the Schmidt number r M cannot exceed 
fc p = maxjd^, to m }, we obtain that the maximum value is 

L = max{<2* | |*) e H} = d^k^ . 

According to Theorem (2), such a maximum is achieved by pure states of the form 
(25). ■ 

The results of Theorems 1,2, and 3 have some important consequences. 
Consequence I (each irreducible subspace contributes to the likeli- 
hood with its dimension) 

According to Eq. (25), the probability of successful discrimination is maximized 
by exploiting in the input state all the irreducible representations appearing in 
the Clebsch-Gordan decomposition of U g . Moreover, the contribution of each irre- 
ducible subspace to the likelihood is related to the dimension d^ by Eqs. (17), (18), 
and (24). In other words, the maximum likelihood approach allows to give a general 
quantitative formulation to the common heuristic argument that relates the quality 
of the estimation to the dimension of the subspace spanned by the orbit of the input 
state. From this point of view, the interpretation of the well known example 18 about 
the quantum information of two parallel vs anti-parallel spin 1/2 particles is clear: 
for parallel spins the input state lies completely in the triplet (symmetric) subspace, 
while for anti-parallel it has a nonzero component also onto the singlet. Evaluat- 
ing the likelihood with Eq. (16), we have indeed p(g\g)° pt = (1 + V3) 2 /2 « 3.73 
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for anti-parallel spins, instead of p(g\g)° pt — 3 for parallel ones. Notice, however, 
that the latter is not the optimal input state in the maximum likelihood approach, 

which instead has coefficients c M = \J-^-, with p = denoting the singlet and 



/i = 1 the triplet. The largest are given by r M = 1, whence c M = y namely 

Co = \ and ci = giving likelihood p(g\g)° pt = 4. It is possible to show 6 that 
this optimal input state can be chosen as a factorized state, such a state being the 
tensor product of two mutually unbiased spin states. 

Consequence II (key role of equivalent representations) 
The repeated use of equivalent representations is crucial for attaining the maxi- 
mum probability of successful discrimination. In fact, in order to achieve the upper 
bound (24) one necessarily needs to use the maximal amount of entanglement be- 
tween representation spaces and multiplicity spaces, corresponding to the maximum 
number of irreducible subspaces carrying the same representation p, for any p in 
the Clebsch-Gordan decomposition. 

Consequence III (maximization of the Holevo x-quantity) 
The optimal states in the maximum likelihood approach are those which maximize 
the Holevo x-quantity 7 , which in the group covariant case is defined as 

XG (p) = S (j d 9 UgpU^j - jdg S(U g pU%) , (26) 

S(p) = — Tr[ ( olog( ( o)] being the von Neumann entropy. In fact, for pure input states 
p = |^)(W|, the x-quantity is the entropy of the average state: xg(p) = S (( P )g)- 
Using Eq. (7), we have 

(P> G = M^^TrHj^M^I] . 

It is then easy to see that, for any pure state p = \\&)(SS?\, 

Xc(p)<logd*, (27) 

and that the bound is attained by states of the form (19). Finally, the maximum 
over all pure states is 

Xc(p) = logL, (28) 

achieved by states of the form (25). In this way, the likelihood is directly related to 
the x-quantity, providing an upper bound to the amount of classical information 
that can be extracted from the orbit of the input state. 



3. Internal vs external entanglement 

Up to now we looked for the optimal input states in the Hilbert space of the 
system undergoing the unknown group transformation. From this point of view, 
the entanglement between representation and multiplicity spaces was just a kind of 
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internal entanglement, between two virtual subsystems 11 with Hilbert spaces given 
by the representation and the multiplicity spaces, respectively. 

Suppose now that we can exploit an additional entangled resource, i.e. we can 
entangle the system that undergoes the unknown group transformation with an ad- 
ditional external system, which acts as a reference. In this case, we have to consider 
the tensor product Hilbert space H (g> Tin, where the group acts via the represen- 
tation {U' g — Ug® TL-ji | g € G}. From the point of view of the group structure, the 
only effect of the reference system is simply to increase the multiplicity of the ir- 
reducible representations. In fact, the Clebsch-Gordan decompositions of {U g } and 
{U' g } contain exactly the same irreducible representations, while the new multiplic- 
ities are = • d-ji, where d-R. = dimH-ji- Therefore, from the sole consideration 
of the decomposition of the Hilbert space, we obtain the following: 

Theorem 4 (use of external entanglement). Once = ■ du > for 
any irreducible representation fie S, any further increase of the dimension of the 
reference system is useless. 

In particular, if d^ > V/i G S, there is no need of entanglement with an external 
system. 



Proof. In the decomposition (9) of a pure state, the rank of any bipartite state 
cannot exceed k'^ = minjc^, m^}. Therefore, once m'^ > d^ for any /x, the orbit of 
any pure state can always be embedded in a subspace where all the multiplicities 
are equal to the dimensions. ■ 

In other words, once the saturation m'^ > d^ is reached, there is no need of 
increasing the dimension of the reference system. 

Corollary 1 (dimension of the reference system). The maximum dimension 
of an external system that is useful for estimation is 



d-n = max 



dfj_ 

m u 



/xeS , (29) 



where the "ceiling" \x~\ denotes the minimum integer greater than x. 

This mechanism of saturation of the multiplicities can be simply quantified in 
terms of the likelihood. In fact, the improvement coming from the external reference 
system can be evaluated using Theorem 3, yielding the value of the likelihood for 
the optimal input state: L' — E M es ^m^' where k'^ — minjd^, m^}. The upper 
bound that can be achieved with the use of a reference system as in Coollary 1 is 
then 

imax=^^ ■ (30) 

tj.es 



l 
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4. Generalization to infinite dimension and non-compact groups 
4.1. Compact groups in infinite dimension 

The main problem with infinite dimension comes from the fact that, in some cases, 
the optimal states of section 2.6 are not normalizable. From a physical point of 
view, this means that one has to approximate them with normalized states in some 
reasonable way, fixing additional constraints as, for example, the energy constraint. 
Clearly the best approximation depends on the particular problem at hand. 

In a similar way, the POVM elements P(g) = U g 5 in general are not 
operators. For example, the well known optimal POVM for the estimation of the 
phase of the radiation field is given by 

P(<f>) = |e(0))<e(0)| , (31) 

where |e(0)) = X^^Lo e%n ^ l n ) are ^ ne so - caue d Susskind-Glogower vectors 19 . 
Since such vectors are not normalizable, the POVM elements P((j>) are not op- 
erators acting in the Hilbcrt space Ji. For this reason, in infinite dimension one 
should substitute the positive operator P{g) with a positive form 7r s , defined by 
n g (|0), |^)) = MP(ffM- 

However, except for this technicality, all results of Section 2.5 concerning optimal 
POVM's are essentially valid in infinite dimension. 



4.2. Non-compact groups 

The generalization of our method to the case of non-compact groups is more in- 
volved than for compact groups in infinite dimension. Nevertheless, such a gener- 
alization is crucial for many physically meaningful cases, e. g. the estimation of 
displacement or of squeezing parameters in quantum optics. 

In the following, we will consider the case of unimodular groups, i.e. groups for 
which the left-invariant measure d^ g (d^/i^ = c\l g Vg,/i <G G) and the right- 
invariant one dug (dugh = dp g Vg,h e G) coincide. We will then define the 
invariant Haar measure as dg = d^ g = d^ g. Notice that, in the Bayesian point 
of view, it is no longer possible to assume the group parameters to be distributed 
according to such a measure, since the uniform measure over a non-compact group 
is non-normalizable. 

In general, for non-compact groups the irreducible representations contained in 
the Clebsch-Gordan decomposition may form a continuous set. To deal with such a 
situation one should replace in the Sec. 2.5 and 2.6 direct sums with direct integrals. 
For example, the decomposition of the Hilbert space (4) would rewrite 17 

e 

m(du) Ti^ (g> M.^ , (32) 

m(du) being a measure over the set of irreducible representations, and de- 
noting the multiplicity space. In the following, we will not carry on this rather 
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technical generalization, leaving the case of a direct integral of irreducible represen- 
tations only to a specific example (see next paragraph). We will instead consider the 
simplest case of group representations that can be decomposed in a discrete series 
of irreducible components. In other words, we will assume that it is still possible to 
write the Clebsch-Gordan decomposition 

lies 

where the set S is discrete. 

Finally, we require any irreducible representation {U^} in the Clebsch-Gordan 
series to be square summable, that, in the in case of unimodular groups, is equivalent 
to the property 

J dg KW| U g |^)| 2 < oo V |W>, I0m> e . (34) 

Under these hypotheses, the results of Theorems 1, 2, and 3 can be immediately 
extended to non-compact groups. In fact, in this case we can exploit a simple 
generalization of the formula (7) for the group average, which allows us the use all 
the results of of Sections 2.5 and 2.6 by just substituting the dimensions of the 
irreducible subspaces with their formal dimensions. 

Proposition 1. Let be {U g } a discrete series of square- summable representations 
of a unimodular group. Then the group average (A)g of a given operator A is given 
by 

(A) G =01„,® , (35) 

pes d » 
where the formal dimension d^ is defined as 

^-(/d.g|(Vvl^l^)l 2 ) 1 . (36) 
IVv) and being any two normalized states in H^. 

Proof. Since the group average (A)g of an operator is in the commutant of the 
representation (33) it has the form (A) G = © M 1h m ® A^, for some suitable 
operators A^ acting in the multiplicity space. Taking the expectation value with 
respect to a normalized vector <= we obtain A^ = (V> M | (A) G |^) = 

Ti Hti [A (Bn) G ], where = |Vv)(Vvl ® Now ' since tne S rou P average (B) G 

is in the commutant of {U g }, and since £ 7i M , we have (B) G = l/d^n^ <8>1ai 
for some constant d^. The constant d^ is simply evaluated by taking the expectation 
value of {B) G with respect to a normalized vector |(/> M )|a M ) e £g> M^. ■ 

Remark. The formal dimension of Eq. (36) is not a property of the sole Hilbcrt 
space Hp,, but also of the irreducible representation acting on it. Depending on the 
particular irreducible representations, the same Hilbert space may have different 
formal dimensions. 



I 



I 
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4.3. An application: two indentical and two conjugated coherent 
states 

Here we give two examples about the estimation ol coherent states of a harmonic 
oscillator. Both cases involve the Abelian group of displacements in the complex 
plane, with projective representation on infinite dimensional Hilbert space Ji given 
by the Weyl-Heisenberg group of unitary operators {D{a) = e aat - Q a | a g C}, 
where a) and a are creation and annihilation operators respectively. Since the group 
is Abelian, it is obviously unimodular, a translation-invariant measure being 
(here we put the constant it just for later convenience). 

In the first example (two identical coherent states) we will consider two identi- 
cal copies of an unknown coherent state, while in the second (conjugated coherent 
states) we will consider two coherent states with the same displacement in posi- 
tion and opposite displacement in momentum. Exploiting the method of maximum 
likelihood we will find in both cases the optimal POVM for the estimation of the 
unknown displacement. From the comparison between the sensitivities of the op- 
timal measurements in the two close analogy will emerge with the well 
known example by Gisin and Popescu about quantum information carried by par- 
allel and anti-parallel spins 18 . This analogy, already noticed in the study of the 
optimal "phase conjugation map" by Cerf and Iblisdir 20 , will be analyzed here in 
detail from the general point of view of group parameter estimation. 



4.3.1. Two identical coherent states 

Here we consider two harmonic oscillators prepared in the same unknown coherent 
state \a), a e C. In this case, the family of signal states is 

S = { \a)\a) eH 82 |«£C}, (37) 

and is obtained from the ground state \^f) — |0)|0). by the action of the two-fold 
tensor representation {D{ot)® 2 \ a e C}. The Clebsch-Gordan decomposition of 
such a representation can be easily obtained by using the relation 

D{af 2 = y f D(V2a) ®1V , (38) 

where V = exp —j(a\a 2 — a\a\) (ai and a 2 denoting annihilation operators for 
the first and the second oscillator respectively). This relation shows that — modulo 
a non-local change of basis in the Hilbert space — the two-fold tensor representa- 
tion is unitarily equivalent to a direct sum where the irreducible representation 
appears with infinite multiplicity. Such a representation is 
square-summable, and has the formal dimension 

\(0\ D(V2a) \0)\ 2 ^ =2, (39) 

given by Eq. (36). Moreover, according to Eq. (8), a possible decomposition of the 
tensor product Hilbert space into irreducible subspaces is given by any set of the 
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form 

H n = Vi H®\<M , (40) 

where {\4> n ) | n 6 0,1,...} is an ortho-normal basis for Ji. By taking the basis 
of eigenvectors of the number operator a^a, we immediately see that the input 
state |^) = |0)|0) completely lies in the irreducible subspace 7io- Denoting by 
P = (Wl <g> |0)(0|)V the projection onto 7Y , we have indeed Pol*) = |*>- Using 
Theorem 1, we have that for the state |^) the optimal- likelihood covariant POVM 
must have H such that PqZPq = \v){v\ with = \/2|0)|0), since here = 1 (sec 
Eq. (12)). Then, we have that any covariant POVM with P Z P = 2 (|0)(0|)® 2 is 
optimal for estimation of a. For example, we can take the POVM 

M(a) = 2 D{a)® 2 (V f V) D(a)^ 2 , (41) 

where the unitary V is defined as above, and 1 1)) is the vector 1 1)) = l n )l n )- 
It can be shown that this POVM corresponds to measuring the two commuting 
observables corresponding to the position of the first oscillator and the momentum 
of the second one. In this scheme, if the outcomes of the two measurements are 
qi and p 2 respectively, we simply declare that our estimate of the displacement is 
a = qi + ip 2 - 

A different POVM which is equally optimal is 

M(a) = 2 D(a)® 2 (V^ |0)(0| <g> 1 V) D(a)^ 2 . (42) 

In a quantum optical setup, this POVM corresponds to performing firstly an het- 
erodyne measurement on each oscillator, thus obtaining two different estimates a\ 
and «2 for the displacement, and then averaging them with equal weights. The final 
estimate is a = (" 1 + Q;2 ) . 

Although the two POVM's are different and correspond to two different exper- 
imental setups, they give rise to the same probability distribution when applied to 
coherent states. It is indeed straightforward to see that the probability density of 
estimating a when the true displacement is a is given in both cases by the Gaussian 

p(a\a) = 2 e - 2 l«-«l 2 (43) 

(normalized with respect to the invariant measure ^f L )- The value of the likelihood 
is p(a\a) — 2, according to our general formula (16). 

Remark : Improving the likelihood with squeezing. 
The maximum likelihood can be improved using the doubly-squeezed state 



|M> X ) = yVl-* 2 I>»|n> , (44) 

n=0 

where without loss of generality we choose x > (x < 1 for normalization) . Then, 
by applying Theorem 1, it is immediate to show that \rj) — ^/2V^\IJ) and to evaluate 
the likelihood of the optimal POVM as 

p(a\a) = 2 i±£ . (45) 
1 — x 



l 



l 
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Notice that for zero squeezing (x = 0) we retrieve the case of two identical coher- 
ent states, while for infinite squeezing (x — > 1~) the likelihood becomes infinite, 
according to the fact that the displaces states D(a)® 2 |4 , a; ^ 1 -) become orthogonal 
in the Dirac sense, allowing for an ideal estimation. 

4.3.2. Conjugated coherent states 

Now the family of signal states is 

S = {\a)\a*) | aeC} , (46) 

where complex conjugation is defined with respect to the basis {|n) | n = 0, 1, . . . }. 
These states are generated from the input state |\&) = |0)|0) by the action of the 
representation {D(a) ® D(a*) \ a e C}. 

Unfortunately, such a representation cannot be decomposed into a discrete 
Clebsch-Gordan series, due to the fact that all the unitaries in the representa- 
tion can be simultaneously diagonalized on a continuous set of non normalizable 
eigenvectors. In fact, for any vector of the form \D(j3))) ~ D((3) ® where 
is the vector = ^ n \n)\n), we have 

D{ct) ® D{a*) \D{13))) = e^-^f 3 \D{fi))) . (47) 

These vectors are orthogonal in the Dirac sense, namely ((D(a)\D(/3))) — Tr5 2 (a — 
(3). Therefore, any such vector can be regarded the basis of a one-dimensional 
irreducible subspace Hp. The multiplicity of any irreducible representation is one, 
and the Hilbert space can be decomposed as a direct integral 

n®n= \ —Hp . (48) 

Jc * 

In the same way as in (6), an operator O G B(H <g> Ti) in the commutant of the 
representation can be written as 

f® d 2 

0= o((3) , (49) 

Jc n 

where 1/3 = \D(/3)))((D(@)\ is the identity in Hp, and o(/3) is some scalar function. 

In this particular example it is easy to extend the results of Sections 2.5 and 
2.6 to the case of a direct integral of irreducible representations. In fact, using 
functional calculus we can generalize the formula (7) for the group average: 

Proposition 2. The average (A)c of an operator over the representation 
{D(a) ® D(a*) | a e C} is 

f® d 2 3 

(A) c = / — ^ 1/3 Tr n0 [A], (50) 
whereTT nfi [A} = ((D(f3)\ A \D(/3)))- 
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This expression for the group average is equivalent to that of Eq.(7) modulo the 
obvious substitutions: 

fm . f® d 2 (3 

U7^es Jc tt 

(51) 

^ -» dp = 1 V/3 e C 

The optimal POVM is obtained by Theorem 1 by making these substitutions. We 
just need to decompose the input state on the irreducible subspaces, i.e. 

|0>|0>= f* *A e-Wl* mm , (52) 

and then take the optimal POVM given by the operator S = \r])(r]\ in Eqs. (14) 
and (15), which in the present case becomes 

r® d 2 8 

\v)= -r- W)» ■ (53) 

Notice that, since the input state |0)|0) has nonzero components in all the irre- 
ducible subspaces, the optimal covariant POVM is now unique. Using such optimal 
POVM, 

M(a) = D(a) ® D{a*) \r)){rj\ £>(a)t ® D(a*)* , (54) 

the probability density of estimating a when the true displacement is a can be 
calculated to be the Gaussian 

p(a\a) = 4 e - 4 l«-«l 2 (55) 

(normalized with respect to the invariant measure — -). 

Notice that the value of the likelihood p(a\a) = 4 could also be calculated 
directly using the formula (16), which now reads 

p(a\a)= ^ e- | ' 3|2/2 ) 2 = 4. (56) 

Comparing the optimal distribution (55) for two conjugated coherent states with 
the corresponding one for two identical coherent states (43) we can observe that 
the variance has been reduced by one half, while the likelihood has become twice. 
It is interesting to note the remarkable analogy between this example in continuous 
variables and the example by Gisin and Popescu 18 about the quantum information 
encoded into a pair of parallel and anti-parallel spins. In fact, in the case of spins the 
authors stressed that, quite counter-intuitively, while two classical arrows pointing 
in opposite direction carry the same information, in a quantum mechanical setup 
two anti-parallel spins carry more information than two parallel ones. In the same 
way, in the continuous variables context, while classically two conjugated points 
a and a* in the phase space carry the same information (such information being 
the couple of real numbers (x,p) ), quantum mechanically two conjugated coherent 
states carry more information than two identical ones. The analogy is even closer, 



I 



I 
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since for spin-i particles the "spin- flip" operation is unitarily equivalent to the com- 
plex conjugation, whence we can regard also the example of spins as a comparison 
between pairs of identical states and pairs of conjugated states. 

It is important to stress that the group theoretical analysis and the maximum 
likelihood approach provide in both cases also a clear explanation of the mechanism 
generating the asymmetry between pairs of identical and conjugated states. In fact, 
the whole orbit of a pair of identical coherent states (or spins states) lies just in one 
irreducible subspace of the Hilbert space, while the orbit of a pair of conjugated 
coherent states (spin states) covers all irreducible subspaces. According to formula 
(16), the likelihood in the case of conjugated states is higher than the likelihood for 
identical states both in the case of coherent and spin states, corresponding to an 
enhancement of the probability of successful discrimination. 
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